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Observations  on  Multi>Peg  Towers  of  Hanoi 

Introduction 

While  investigating  the  relative  power  of  abstract  storage  models,  a 
subproblem  that  is  a  generalization  of  the  classic  Towers  of  Hanoi  problem 
arose  [Ball  ’92],  [Ball  &  Coxeter  ’74].  The  standard  version  has  been  used  to 
teach  recursive  methods  in  computer  science  [Aho,  Hopcroft  &  Ullman  ’74], 
[Wirth  ’76],  and  recursion  removal  [Partsch  &  Pepper  ’76].  There  have  been  a 
number  of  papers  dealing  with  various  recursive  and  iterative  algorithms 
[Hayes  ’77],  [Buneman  &  Levy  ’80],  [Walsh  ’82],  all  of  which  have  the  same 
sequence  of  moves  [Wood  ’81],  but  require  differing  amounts  of  program  state 
memory.  The  classic  problem  has  also  been  related  to  the  Gray  code  [Gardner 
’72],  [Buneman  &  Levy  ’80],  [Er  ’82]. 

A  generalization  of  the  nature  of  changing  the  starting  configuration  has 
been  investigated  ([Er  ’83]),  which  is  referred  to  in  the  literature  as  the 
Generalized  Towers  of  Hanoi.  In  addition,  restriction  to  movement  in  only  one 
direction  (the  Cyclic  Towers  of  Hanoi  [Atkinson  ’81],  [Walsh  ’83],  [Er  ’84b]  and 
its  generalization  of  restricting  some  disks  to  move  in  one  direction  and  the 
rest  to  move  in  the  other  direction  (the  Colour  Towers  of  Hanoi  [Er  ’84a]  and 
Generalized  Colour  Towers  of  Hanoi  [Er  ’84c])  have  been  studied.  The 
problem  with  tree-like  stacks  instead  of  stacks  has  been  approached 
[Engelfriet  ’81],  [Jurgenson  and  Wood  ’83],  and  is  one  natural  generalization 
of  the  problem.  The  original  problem  can  be  viewed  as  a  unary  tree  in  this 
context.  For  the  Trees  of  Hanoi,  a  number  of  work  pegs  equal  to  the  arity  of 
the  trees  is  required.  Jurgenson  and  Wood  do  ask  the  question  of  how  having 


multiple  sets  of  additional  work  pegs  will  affect  the  solution  as  an  open 
question  at  the  conclusion  of  their  paper. 

However,  solutions  to  the  most  obvious  generalization  of  the  classic 
problem,  that  of  a  different  number  of  pegs,  has  not  appeared  in  published 
literature  to  my  knowledge.  This  generalization  was  first  introduced  to  me  by 
Pat  Hayes  in  late  1983,  at  which  time  Peter  Gacs  did  some  interesting  upper 
bounds  work,  although  that  was  only  written  up  recently  [Gacs  *86].  Gacs 
does  not  not  claim  originality,  rather  independence,  citing  personal 
communications  from  Janos  Kolmos  to  the  effect  that  is  considered 
Hungarian  folklore,  and  that  Imre  Csiszar  had  done  similar  work. 

The  classical  Towers  of  Hanoi  game  involves  three  pegs,  one  of  them  with  a 
pile  of  disks  of  different  sizes  on  it  in  order,  from  the  smallest  at  the  top  to  the 
largest  at  the  bottom.  The  problem  is  to  move  the  tower  of  disks  from  the  peg 
they  are  on  to  another  peg,  moving  one  disk  at  a  time  from  its  peg  to  another 
peg  but  never  placing  a  disk  on  a  smaller  disk. 

This  problem  has  well-known  matching  upper  and  lower  bounds  of  2n  —  1 , 
where  n  is  the  number  of  disks  [Ball  &  Coxeter  *74].  The  algorithm  for  the 
upper  bound  was  the  subject  a  number  of  papers  that  explored  the  amount  of 
storage  necessary  for  the  solution.  As  it  turns  out,  there  is  a  simple  iterative 
solution,  so  the  storage  required  is  constant  [Hayes  *77].  Refer  to  Figure  1  for 
the  (space- wasting)  recursive  algorithm. 

The  Multi-Peg  Towers  of  Hanoi  Game 

The  multi-peg  version  of  the  game  permits  the  use  of  k  work  pegs  instead  of 
the  customary  one.  The  classic  game  has  k  + 2  pegs:  the  source  peg,  the 
destination  peg,  and  one  work  peg.  Here  we  have  k+2  pegs,  the  source,  the 
destination,  and  k  work  pegs.  The  problem  is  basically  the  same,  to  move  the 


MoveTower(n,  s,  d,  w) 

int  n;  /*  number  of  disks  to  move  *J 

jpeg  s,  d,  w;  /*  source,  destination  and  work  peg  */ 

if(n=  =0)  return; 
else  if  (n  =  =  1) 

MoveDisk(s,  d); 
else 
{ 

MoveTower(n  - 1,  s,  w,  d); 

MoveDisk(s,  d); 

MoveTower(n  —  1,  w,  d,  s); 


Fig.  1.  Algoritnm  1,  the  classic  recursive  "three-peg”  algorithm. 
MoveDisk(s,  d)  moves  a  single  disk  from  peg  s  to  peg  d. 


perfectly  ordered  tower  of  disks  from  the  source  to  the  destination  one  disk  at 
a  time,  using  the  work  pegs  as  needed,  and  never  placing  a  disk  on  a  smaller 
one.  Throughout  the  remainder  of  this  paper,  T(n,  k)  will  represent  the  least 
amount  of  time  required  to  move  n  disks  from  the  source  to  the  destination 
using  k  work  pegs. 

Proposition  1:  For  any  n,  k  >  0,  the  time  T(n,  k)  required  to  move  n  disks 
from  the  source  to  the  destination  is  at  least  2n  —  1 . 

Proof: 

The  top  n  — I  disks  must  be  removed  from  the  bottom  disk  before  it  may  be 
moved,  requiring  at  least  n  —  1  moves.  The  bottom  disk  may  then  be  moved  to 
the  destination,  requiring  one  move.  At  least  n—1  more  moves  are  then 
required  to  place  the  smaller  disks  on  the  bottom  disk,  for  a  total  oi2n  —  l. 

13 

This  is  the  lower  bound  to  move  n  disks  in  a  particular  order  to  another 
peg,  in  the  same  order,  regardless  of  what  ordering  constraints  exist,  if  only 
one  disk  may  be  moved  at  a  time  and  the  moves  treat  the  pegs  like  stacks 
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(addition  and  removal  only  from  the  top  of  the  peg).  This  bound  may  be 
achieved  if  there  are  sufficient  work  pegs. 

Proposition  2:  For  any  n  >  0,  ifk^n  —  1  then 

T(n,  k)  =  T(n,  n  —  1)  —  2n—l. 

Proof: 

It  is  easy  to  see  that  the  n—1  smaller  disks  may  be  spread  out  across  the 
n—1  work  pegs  in  time  exactly  n  —  1.  The  total  time  is  then  no  more  than 
T(n,n  —  l)^n  —  l+l  +  n  —  l  —2n  —  l. 

Since  this  matches  the  lower  bound,  it  is  clearly  the  exact  time,  (xj 

This  linear  upper  bound  when  there  are  plenty  of  work  pegs  and  the 
exponential  lower  bound  when  there  is  one  work  peg  demand  investigation 
into  the  behavior  when  there  is  an  intermediate  number  of  work  pegs.  This  is 
easier  said  than  done,  however.  The  lower  bound  when  there  is  but  one  work 
peg  is  relatively  easy  to  establish  because  the  extremely  limited  number  of 
work  pegs  restricts  what  can  possibly  be  done  to  continue  to  make  progress. 
With  additional  pegs  a  lower  bound  has  not  been  found,  although  it  appears 
certain  to  be  exponential  if  the  number  of  pegs  is  constant. 

The  difficulty  posed  by  analysis  of  games  with  additional  pegs  is  that  the 
intermediate  configurations  are  not  as  constrained;  with  only  one  work  peg, 
you  must  build  a  complete  stack  on  the  only  available  peg  in  order  to  free  the 
next  disk.  With  more  pegs,  you  have  the  the  choice  of  building  lots  of  little 
towers  quickly  or  a  few  big  ones  as  intermediate  configurations.  The  relevant 
tradeoffs  appear  to  be : 

(A)  the  time  to  build  the  (intermediate)  tower(s) 


(B)  its  (their)  size 

(C)  the  number  of  towers  to  build 


.*  w. 


SiN 


If  we  choose  the  strategy  of  building  one  "working  stack”  at  a  time,  then  we 
retain  all  but  one  of  the  working  pegs  free  for  future  use.  By  using  the  spread 
and  stack  algorithm  (see  Figure  2)  as  a  subroutine,  we  can  treat  substacks  of 


Spread&Stack(n,  s,  d,  W) 
int  n;  /*  number  of  disks  to  move  *J 
peg  s,  d;  /*  source  and  destination  peg  V 
list  of  pegs  W;  /*  work  pegs  */ 

if(n  =  =0)  return; 
else  if  (n  =  =  1) 

MoveDisk(s,  d); 
else  if  (n  >  card(W)  + 1) 

halt;  /*  not  enough  work  pegs  */ 
else 


wp  =  first(W); 
for(i  =  l;i<n;i+  +) 
{ 

MoveDisk(s,  wp); 
wp  =  nextfwp); 

MoveDisk(s,  d); 
wp  =  firsKW); 
for(i  =  l;i<n;i+  +) 

MoveDisk(wp,  d); 
wp  =  next(wp); 


} 


Fig.  2.  Algorithm  2,  "spread  and  stack”.  MoveDisk(s,  d)  moves  a  single 
disk  from  peg  s  to  peg  d;  cardfW)  is  the  size  of  W,  or  the  number 
of  work  pegs;  next(wp)  returns  ths  next  peg  in  W  after  wp. 


size  k  as  "superdisks”,  and  use  the  standard  (three-peg)  algorithm  on  these. 
We  may  do  this  because  there  will  always  bek  —  1  work  pegs  available  to  move 
the  superdisks  (the  first  superdisk  can  be  of  size  k  + 1 ,  but  for  simplicity’s  sake, 
we  will  ignore  this.)  If  we  have  n  =  m  X  k  disks,  then  we  require  2m  —  l  moves 
of  the  superdisks,  each  requiring  2k  — 1  moves,  or  a  total  of 
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T(n,k)  =  T(n ,  n/m)z  (2k-l)(2m-l) 


=  (2k-l)(2nJk-i)  =  0(k2nJk) 


when  k  =  n/m =0(n). 


Proposition  3  :  If  the  number  of  works  pegs  k^n/fln),  then  the  time 
required  to  move  n  disks  from  the  source  to  the  destination  is  no  more  than 
T(n,  n/f(n))  <.  (2n/f[n)  -  l)(2fCn) -1)  =  0{(n/f(n))2f(n)). 

Proof: 

Given  above,  jx) 

We  achieve  this  same  bound  if  we  use  the  strategy  of  building  k  substacks 
of  size  n/k  using  the  three-peg  algorithm.  Clearly  the  failure  to  use  the  extra 
work  pegs  whenever  possible  wastes  some  of  their  potential.  Still,  this  crude 
upper  bound  gives  us  some  interesting  results  when  we  plug  in  certain 
functions  for  k.  All  three  of  these  corollaries  are  direct  applications  of 
proposition  2. 

Corollary  4 :  If  k  =  log  n,  then 

T(n,  log  n)  =  0 (2*(log  n)/n). 

Corollary  5 :  If  k  =  n°,  where  a  is  a  constant,  then 

T(n,  n°)  =  0(n<*  2nl  ~a). 

Corollary  6 :  If  k  =  n/log  n,  then 

T(n,  n/log  n)  =  O (n2/log  n ). 

Another  approach  to  solving  the  problem  of  space  (the  number  of  work 
pegs)  and  time  tradeoffs  is  to  determine  what  space  is  necessary  for  a  linear 
time  solution.  Clearly,  k  =  n—l  works  pegs  suffice.  Here,  each  disk  moves  at 
most  twice.  What  happens  if  we  constrain  each  disk  to  move  at  most  four 
times?  (Note  that  three  moves  is  too  restrictive,  using  the  observation  that  if 
we  move  a  disk  onto  a  larger  disk  on  a  work  peg,  then  we  will  have  to  move  it 
at  least  twice  more,  once  to  free  the  larger  disk  under  it  and  once  to  put  it  into 
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FourMove(n,  s,  d,  W) 
int  n;  I*  number  of  disks  to  move  *J 
>eg  s,  d;  I*  source  and  destination  peg  */ 
ist  of  pegs  W;  /*  work  pegs  */ 

if(n=  =0)  return; 
else  if  (n  =  =  1) 

MoveDisk(s,  d); 

else  if((sqrt(8n  +  l)  — 3)/2>card(W)) 
halt;  f*  not  enough  work  pegs  */ 
else 
{ 

k  =  card(W); 
wp  =  firstCW); 

WL=W-[wp]; 
for(i  =  l;i<k;i+  +) 

{  /*  must  test  for  end  conditions  here  */ 

Spread&Stack(k  +  2  —  i,  s,  wp,  WL + [d]); 
wp  =  next(wp); 

WL=WL  — [wp]; 

MoveDisk(s,  d); 
wp  =  first(W); 

WL=W-[wp]; 
for(i  =  l;i<k;i  +  +) 

{  /*  must  test  for  end  conditions  here  */ 

Spread&Stack(k  +  2 — i,  s,  wp,  WL  +  [d]); 


wp  =  next(wp); 
WL=WL-[wp]; 


Fig.  3.  Algorithm  3,  "FourMove”.  No  disk  moves  more  than  four  times. 
Subroutines  appear  in  Figure  2. 

place  in  the  final  stack,  for  a  total  of  at  least  four  moves.)  The  best  we  can  do  is 
to  build  towers  using  the  spread  and  stack  algorithm  (using  at  most  two  moves 
apiece),  then  move  the  bottom  disk  to  the  destination  peg,  then  move  the 
towers  in  reverse  order  using  spread  and  stack  to  the  destination  peg.  This 
FourMove  algorithm  is  shown  in  Figure  3.  How  many  disks  can  we  move  in 
this  manner? 


$ 

*!* 


i 

S 

l 
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Let  N(4,  k)  be  the  maximum  number  of  disks  we  can  move  using  at  most 

four  moves  per  disk  and  k  work  pegs.  We  can  build  k  towers  of  size  k  +  1,  k, 

k-lf  ....  2  (each  tower  built  decrements  the  number  of  work  pegs  available 

until  only  the  destination  peg  is  left)  and  then  move  the  bottom  disk  to  the 

destination,  for  a  total  of 

.  .  (*  +  lK*  +  2)  *2+ 3*  +  2 

mt.k)  =  1 - - - —  =  — - — . 


The  total  time  T4(N(4,  k),  k)  taken  is 


i+1  *+1 


r4JV(4,  *),(*)  =  2^  (2i  —  1 ) — 1  =4^i-2^1-l  =  2*(*  +  2)  +  l 


i=l  t=l 


By  solving  for  k  in  terms  of  n,  we  can  determine  that  if 

V  8n+l  -  3 


then 

V  8n+ 1  -  3  . - 

T(n,  k)  =  T(n, - )  =  4n  -  V  8n+ 1  =  O(n) . 

2 

Continuing  in  this  line  of  reasoning,  let  us  see  what  can  be  done  by 
restricting  each  disk  to  move  no  more  than  eight  time.  Using  the  FourMove 
scheme  to  move  as  much  as  possible  to  each  of  the  k  work  pegs,  we  may  move 
(fc+J)(fe+2)/2  disks  to  the  first  peg,  k(k  +  l)/2  to  the  second  work  peg,  ....  3 
disks  to  the  peg.  The  bottom  disk  may  be  moved  to  the  destination  peg, 
and  the  process  reversed.  The  number  of  disks  we  have  moved  is  then 


M8,*)  =  1  +  ^(i+lXi+2V2  = 


*3  +  6*2+  11*  +  6 


with  the  time  required 


rg(JV(i 8, *),*)  =  1  +  2  jr  [2(i+ 1)2  -  l]  = 


2*3  +  9*2+  10*  +  3 


Thus  with  fewer  than  k  =  (n/6)^3  pegs  we  can  move  n  disks  in  time  less 
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The  following  lemma  will  be  used  to  establish  a  general  upper  bound  of  the 


sort  derived  above. 


Lemma  7:  For  integers  n  a  2,  c  2  0, 


(c+l)! 


Proof: 


This  may  be  proven  by  observing  that  it  is  true  for  n  =  1 ,  then  assuming  it 


is  true  for  n<N+l.  In  that  case, 


„  ,  *  ,  +  1  ,  tfc  +  1+(c+l)(AM-l)c 

y  ic  =  (N+l)c+Yic  a  — -+(tf+l)c  =  - — — - 

.  .  .  ,  (c+l)  (c+l) 

1=1  1=1 


Nc  +  1  +  (c+l)(N+l)c  (iV+l)c  +  1 


(c+l) 


(c+l) 


since 


(c+lM‘)  &(*+[), 


because 


(c  +  1 ) o  (c  + 1 ) ^(c_^  yl(c_7),  >  (j+l )Hc_jV 


(c+l)!  (c+l)!  +1 

iKc— /)!  (i+1  )!(c— /)!  J  + 1 


Furthermore,  equality  only  holds  when  c = 0.  [xj 
We  use  this  lemma  to  prove  the  following  theorem. 


Theorem  8  :  If  k  =  (c!n)^c  *»  (c/eln^c  for  some  constant  c,  where  e  is  the 


base  of  the  natural  logarithm,  then  T(n,  k)  <  2cn. 


Proof: 


We  induct  using  the  methods  given  above.  The  base  case  of  c  —  1  is  trivially 


Suppose  that  the  theorem  holds  for  c<C.  For  T(n,  k)<2Cn  moves,  we  (too 
sternly)  insist  that  no  disk  make  more  than  2c  moves  (and  know  that  at  least 


one  disk  will  make  less  than  that,  e.g.:  the  bottom  disk,  which  will  make 


exactly  one  move.)  This  can  be  enforced  by  using  the  strategy  of  building  k 


I 


10 


towers  using  no  more  than  2C-1  moves  per  disk,  then  moving  the  bottom  disk, 
then  reversing  the  tower  building  process  to  stack  those  disks  on  the  bottom 
one.  Clearly,  this  uses  no  more  than  2C  moves  per  disk. 

As  before,  we  will  fix  k  and  c,  then  determine  the  maximum  number  of 
disks  that  can  be  moved  under  this  strategy.  Let  N(c,  k)  be  the  maximum 
number  of  disks  that  can  be  moved  using  no  more  than  2c  moves  per  disk. 

Claim:  N(c,  k)  >  kc/cl 

Proof  of  Claim:  It  has  already  been  shown  for  any  k  and  for  c=2.  Suppose 
it  is  true  for  any  k  and  for  c<C.  Then  by  using  k  work  pegs  to  build  the  first 
tower,  k  —  l  work  pegs  to  build  the  second  and  so  on,  we  can  build  k  towers  of  a 
combined  size 

k  k  -(C-l) 

AT(C,*)>2><C-l,i)>  , 

«=  l  i  =  ilu 


_ 1 _ y  . 

(C— 1)!  T“  * 


^  _ 1 


kC  kC 


(C-l)!  X  C  C! 


This  follows  from  lemma  1  above.  Since  N(C,  k)>kC/C\,  this  means  that  for 
any  number  of  disks  nsN(C,  k),  we  may  move  them  in  2Cn  moves.  In  other 
words, 

kQCl  <  ns  N(C,  k),  so 
kC  <  C!n,  or 
k<{C\n)l/C. 

Certainly,  having  extra  work  pegs  cannot  do  us  harm,  so  if  we  can  get 
away  with  fewer  than  (C\n)UC  work  pegs,  we  can  do  at  least  as  well  with  more. 
0 

Conjecture  :  For  any  average  number  of  moves  per  disk  2"*  (i.e.:  linear 
constant),  and  any  fixed  number  of  work  pegs  k,  there  is  some  number  M(m,k) 
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i 

> 

of  disks  such  that  at  least  2mXM( m,  k)  moves  are  required  to  move  M( m,  k) 
disks  from  one  peg  to  another  peg. 

Peter  Gacs,  crediting  others  with  similar  but  unpublished  discoveries, 
derived  the  following  upper  bounds  with  some  specific  applications. 

Theorem  9  (Gacs) :  Let  msfe  be  a  number  such  that  n;s(m4).  Then 

Tin,  *)S2  m-*+1mm{(Jk*1),  Q}. 

Corollary  10  (Gacs)  :  If  is  small  with  respect  to  n,  so  m=  Tfen^l  +  l, 
then 

.  Vk 

Tin,  k)  S  n2*n  . 

Corollary  11  (Gacs)  :  If  k  is  logarithmic  with  respect  to  n  then  T(n,  k) 
becomes  polynomial  in  n.  Further,  if  tk  =  log  k/k,  and 

k  =  T (zk  +  H2)log  n"l, 

then 

1.5+e. 

Tin,  k)  £  2n  *. 

As  a  measure  of  the  tightness  of  the  upper  bound,  if  k  =  n  —  1  and  m-n ,  the 
inequality  of  the  theorem  gives  T(n,n  —  l)<.4n. 

In  order  to  prove  the  theorem,  the  following  lemma  is  used. 

Lemma  12  (Gacs)  :  For  any  positive  integers,  no,  n;  with  n  =  no  +  ni  we 
have 

T(n,  k)^T(no,  k  —  1)  +2T(ni,k) . 

Proof :  (Gacs) 

■'j 

By  moving  n;  disks  to  one  peg  using  k  work  pegs,  then  moving  no  disks 
to  the  destination  peg  using  k  —  1  work  pegs,  then  moving  the  ni  disks  to 
the  destination  peg  using  k  work  pegs,  we  obtain  the  desired  inequality. 

0 

Proof  of  the  Theorem :  (Gacs) 

We  prove  it  for  n=(mk).  The  estimates  from  the  min  expression  are 


i 

1 


12 


proven  separately,  by  induction  on  m,  starting  with  m  =  k.  At  m  =  k,  we 
have  n=l,  so  any  positive  integer  upper  bound  is  true. 

For  k=l,  we  have  m  =  n,  so  TYn,  k)—2n—l,  so  the  inequality  is  true. 
Suppose  that  the  relation  holds  for  m  —  1  and  that  k>l.  Using  the 
identity 

define 


no  =  (X:i1),ai«fn1  =  (m;1). 


Then  by  the  lemma,  the  inductive  assumption,  the  first  corollary  and  the 
identity,  we  have 

1Xn,k)*T(nQ,k-l)+2T(n1,k)  , 


T(n,k)  <:  2m-*+1(^-^)  +  2x2m-*(”'.1)  =  2m-*+1(^.). 


0 


The  theorem  and  its  corollaries  give  better  results  than  the  naive  approach 
of  proposition  2  for  non-linear  functions,  but  they  are  better  for  k  =  n/c.  The 
upper  bound  when  k  =  {c/e)n^c  is  also  better.  But  what  about  lower  bounds? 
Their  difficulty  has  already  been  alluded  to,  but  what  can  we  really  say  about 
them?  First  we  must  define  a  new  function,  F(n,  t,  k). 

Definition:  Let  F( n,  t,  k)  be  the  number  of  moves  required  to  transfer  n 
disks  from  one  peg  to  t  target  pegs  using  k  additional  work  pegs. 

It  is  easy  to  see  that  T(n,  k)  =  F(n,  1,  k).  Also  obvious  are  the  following 
inequalities. 

(A)  F(n,t,k)  s  F{n,t  —  l,k  +  l) 

(B)  F(n,  t,  k)  s  F(n,  t,k-l ) 

(C)  F(n,  t,  k)  s  mitiij  {F(n  - i,  t - j ,  k  +j)  +  F(i,j,  k)} 

(D)  F(n,  t,  k)  >  F(n  - l,t,k ) 


The  first  of  these  can  be  phrased,  "It  can’t  hurt  to  let  a  work  peg  be  a  target 
peg.”  The  second  can  be  phrased,  "It  can’t  hurt  to  have  an  additional  work 
peg.”  The  third  is  simply  a  statement  that  we  can  do  at  least  as  well  as  the 
obvious  algorithm  of  moving  some  of  the  disks  onto  some  of  the  target  pegs, 
then  moving  the  rest  of  the  disks  onto  the  rest  of  the  target  pegs.  The  last  is 
simply,  "It  is  strictly  worse  if  there  are  more  disks  to  move.”  Really,  the 
inequality  (A)  should  be  strict,  but  this  requires  a  more  subtle  proof.  It  should 
be  pointed  out  that  adding  a  disk  and  a  target  peg,  or  even  a  disk  and  a  work 
peg,  do  not  make  things  worse  in  general  (it  appears  that  in  some  cases  that 
the  time  required  actually  decreases,  witness  F(7,  1,  2)  and  F(8,  1,  3)  for 
example.) 

We  can  use  these  to  show  some  interesting  equalities. 

Theorem  13 :  The  following  equalities  all  hold. 

(1)  F(n,  n,  0)  —  n. 

(2)  F(n,l,n~l)  -2n-l. 

(3)  F(n,l,l)  =  T(n,l)  =  2n-l. 

(4)  F(n,  t,  n—t)  =  2n  —  t,  where  t>0. 

(5)  F(n,  l,k)  =  l  +2F(n-l,  k,  1)  =  T(n,  k). 

(6)  F(n,2,0)  =  F(n-l,l,l)+F(l,l,0)  =2n-l. 

(7)  F(n,t,0)  =  l+F(n-l,t-l,l). 

(8)  F(n,t,k)  =  2n  —  tift>l  and t  +  k>n/2. 

(9)  F(n,  1 ,  k)  =  2(2n —k—2)  +  l,  provided  k  + 1  a  n/2. 

Proof : 

(1  )F(n,  n,  0)  =  n. 


Proof:  obvious.  (X] 
(2)  F(n,  l,n  —  l)  =  2n-l. 
Proof:  Shown  above. 


(3  )F(n,l,l)  =  T(n,l)=2n-1. 

Proof:  Well  known  and  easy.  (x] 

(4 )F(n,t,n—t)  =  2n—t,  where  t>0. 

Proof:  True  for  all  niff =2  orf  =  n.  Iff  =  n,then 

F(n,t,n-t)=F(n,  n,0)  =  n=2n  —  t, 
while  iff =2,  then 

F(n,t,n  —  t)=F(n,l,n  —  l)=2n—l  —2n—t. 

By  the  simple  algorithm  of  spreading  the  n  disks  out  over  the  n 
target  and  work  pegs,  then  moving  the  n—t  smallest  disks  onto  a  target 
peg  (in  largest  to  smallest  order),  we  have 

F(n,t,n-t)  <;  F(n,  n,0)  +  F(n—t,n—t,0)  =  n+fn-f) 

F(n,  t,n  —  t)  s  2n  —  f . 

We  may  spread  n  disks  over  f  >2  target  pegs,  reserving  the  smallest 
disk  by  itself  on  one  peg  in  2n  —  t  moves.  This  can  be  seen  by  the 
application  of  (C),  letting  i  =  t— 1. 

Fin,  t,n—t)  s  F(n—i,  t—i,n—t  +  i)+F(i,  i,n—t) 

<,  F(n-(t-l),l,n—l)+F(t—l,t-l,n-t) 

=  [2(n—t  +  l)—l]+(t  —  l)  =  2n—t, 

The  last  step  uses  (2)  above. 

Therefore,  for  f-2  target  pegs,  we  can  use  this  method  to  move  the  n 
disks  onto  t  pegs  with  the  smallest  disk  on  a  work  peg.  Then  we  move 
the  smallest  disk  from  the  work  peg  onto  one  of  the  the  target  pegs,  so 

F(n,t-l,n  —  t+l)  s F(n,t,n—t)  +  l. 

This  gives  us 

F(n,t,n-t)  ^  F(n,t-l,n-t  +  l)-l. 

Assuming  the  induction  hypothesis  for  t’<  f,  we  have 
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F(n,t,n—t)^2n—(t—l)—l  =  2n  —  t. 

Since  2n  —  tzF(n,t,n—t)^2n—t,  the  hypothesis  holds  for  all  t.  [3 

(5)  F(n,  l,k)  =  1  +2F(n -l,k,  1)  =  T(n,  k). 

Proof:  For  the  ordering  constraint  to  be  observed,  we  must  first  remove 
the  top  n—1  disks  off  the  bottom  disk  and  onto  the  available  pegs, 
leaving  one  free  to  accept  the  bottom  disk,  then  move  the  bottom  disk. 
The  best  we  can  do  to  stack  the  other  disks  on  top  of  the  bottom  disk  is  to 
reverse  the  moves  used  to  spread  them  over  the  k  pegs,  with  appropriate 
substitution  of  pegs.  Otherwise,  we  would  have  a  faster  method  of 
spreading  n—1  disks  over  k  pegs.  The  total  time  is  then  F(n  —  1,  k, 
1)  +  1  +F(n  — l,k,  1),  asdesired.  [3 

(6) F(n,2,0)  =  F(n-l,l,l)+F(l,l,0)  =  2n-l. 

Proof:  The  top  n  —  1  disks  must  be  removed  from  the  bottom  disk  onto 
one  target  peg,  using  the  other  as  a  work  peg.  Only  then  may  the 
bottom  disk  may  be  moved.  (3 
(7  )F(n,t,0)  =  l+F(n-l,t-l,l). 

Proof:  Similar  to  the  proof  of  (6).  [3 

(8)  F(n,  t,  k)  =  2n—t  if  t>l  and  t+k>nJ2. 

Proof:  We  use  equality  (4)  and  inequality  (B), 

2n  —  t  =  F (n,t,n  —  t)  £  F(n,  t,  k). 

Now  we  use  inequality  (C)  and  equality  (4), 

¥(n,t,k)sF(t+k,l,t  +  k—  l)+F(n—(t  +  k),t  —  l,k) 

=  [2(t  +  k)-lJ+[2(n-(t+k))-(t-l)J  =  2n-t. 

Equality  (4)  may  be  applied  to  the  second  term  because 
n—(t  +  k)  s  (t  —  1 )  +  k  —  t  +  k  —  1, 
which  is  true  if  and  only  if 
n  s  2(t  +  k)  —  1, 


; 
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which  is  equivalent  to  our  requirement  that 
n/2  <  t+k.  0 

(9)  F(n,  l,k)  —  2(2n — k  —  2)  +  2 ,  provided  i  +  Ja  n/2. 

Proof:  Using  (5)  we  have 

F(n,l,k)  =  2F(n-l,k,l)  + 1. 

Since  our  requirement  that  k  +  1  2  rx/2  implies  that  k  +  1  >  (n  —  l)/2,  we 
may  apply  (8)  to  the  right  hand  side  to  obtain 
F(n,l,k)=2f2(n-l)-k]+l, 
as  desired.  0 

To  sum  up  these  results,  one  can  say  that  F  is  "well-behaved”  and  "small” 
as  long  as  t+k  >  n/2.  Outside  of  this  range,  it  seems  to  really  blow  up.  Some 
remarkable  things  can  be  seen  to  occur.  First,  note  that  even  when  t  +  k  <  n,  in 
some  cases  it  does  not  help  to  add  an  extra  work  peg  (see  (4)  and  (8)).  Second, 
there  is  a  general  lower  bound  of  4n  -  2k — 3  on  T(n,  k)  given  by  (9). 

Open  Questions 

It  would  be  highly  desirable  to  prove  that  inequality  (C)  is  actually 
equality,  that  inequality  (A)  is  strict  in  certain  ranges  and  that  inequality  (D) 
can  be  strengthened.  Since  (B)  cannot  be  improved  upon,  the  list  should  really 
be 

(A’)  F(n,t,k)<F(n,t  —  l,k  +  l)vrhent+k<n/2-l  or k  =  l 
(B)  F(n,t,k)  s  F(n,t,k-1) 

(C’)  F(n,t,k)  =  minij{F(n-i,t-j,k+j)+F(i,j,k)} 

(D’)  F(n,t,k)  >  F(n  —  l,t,k)  +  l  whenf<n. 

The  last  of  these  has  the  condition  that  t<n  since  when  t=n,  all  that  is 
required  is  to  move  the  additional  disk  onto  the  empty  target  peg. 


A  general  form  for  F(n,  t,  k)  is  still  sought,  or  at  least  bounds  better  than 
those  given  here.  In  particular,  better  understanding  of  the  behavior  of  F(  rt,  t, 
k)  when  k  and  t  are  small  is  necessary.  Finding  tight  general  bounds  for  T(n, 
l,k)  =  T(n,  k)  is  the  most  interesting  open  question.  A  seemingly  obvious  but 
tricky  conjecture  will  be  our  parting  note. 

Coi^jecture :  For  k  constant,  T(n,  k )  is  exponential. 
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Appendix 

Values  for  the  function  F(n,  t,  k)  for  some  small  values  ofn,n£t+k<2. 
Only  the  parameters  are  given,  and  the  columns  roughly  correspond  to 
constant  t  +  k.  The  s  sign  is  used  where  Theorem  13  does  not  cover  and 
inequality  C  is  used. 

(2  2,0)  =2  (2,1,1)— 3 

(32,0)=3  (32,1)=4  (3 2,0)  =4 

(3.12) — 5  (3,1,1)  =  7 

(4,4,0)=4  (42,D  — 3 —(42,0) —  5 

(422)  =  6=(42,1)  =  6  (4 2,0) =8 

(4.12)  — 7  (4,12)  — 9  (4,1,1)  =  15 

(5  2,0) =5  (5,4,1)  —  6  =  (5,4,0)  =  6 

(5,32)=  ...=(52,0)  =  7 
(522)=  ...=(52,1)=8  (52,0)=16 

(5. 1.4)  =  9  (5,12)  =  11  (5,1 2)  =  13  (5,1,1)  =  31 

(6,6,0)  =  6  (62,1)  =  7=(6,5,0)=7 

(6.42) =  ...  =(6,4,0) =8 
(622)=  ...  =(6,3,0)  =  9 

(62.4) =  ...  =(622)  =  10  (62,1)*12 

(6.12)  =  11  (6, 1,4)  =  13  (6,1 2)  =  15  (6,1 2)  =  17  (6, 1,1)  =  63 

(7, 7,0)  =  7  (  7,6,1  )=8=(  7,6,0)  =  8 

(7.52)  =  ...=(7,5,0)=9 

(7.42)  =  ...  =  (7,4,0)  =  10 

(7.3.4)  = ...  =  (7,3,1)  =  11  (72,0)  s  13 

(722)  =  ..  =(722)  =  12  (72,1)*16  (72,0)  =  6< 

(7.1.6)  =  13  (7,1,5)  =  15  (7, 1,4)  =  17  (7,1,3)  =  19  (7.12)*25 
(7,1,1)  =  127 

(8,8,0)=8  (8,7,1)  — 9 =(8,7,0)  =  9 

(8, 6  2)  =  ...  =  (8, 6,0)  =  10 

(8.52) =  .  =  (8, 5,0)  =  11 

(8. 4.4)  =  . ..  =  (8, 4,0)  =  12 
(822)=  .=(8,3,1)  =  13  (8,3,0)*17 

(82.6)  =  ...  =  (82 2)  =  14  (822)*16  (82,1)*20  (82,0)  =  128 

(8.1.7)  =  15  (8, 1,6)  =  17  (8,1 2)  =  19  (8, 1,4)  =  21  (8, 1,3)  =23 

(8,1 2)  *33  (8,1, 1)=255 

(9,9,0)  =  9  (9,8,1)  =  10  =  (9, 8,0)  =  10 

(9, 7  2)  =  .. .  =  (9, 7,0)  =  11 
(9,62)=  .=(9,6, 0)  =  12 

(9.52)  =  ■■■=(92,0)= 13 

(9. 4. 5)  =  ...=(9, 4,0)  =  14 

(9  3  6)—  =(9  3  1)  =  15 

(92.7)  =  . "..  =  (922)  =  16  (922)*20  (92,1)*24  (92,0)  =  256 
(9,1,8)  =  17  (9,1,7)  =  19  (9, 1,6)  =  21  (9, 1,5)  =  23  (9. 1,4)  =  25 

(9,1 ,3)  =  27  (9,1 2) *41  (9,l,l)  =  5ll 


